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CS| i Abstract 

i In the complete graph on n vertices, when each edge has a weight which is an 

I exponential random variable, Frieze proved that the minimum spanning tree has weight 

^ ■ tending to C(3) = 1/1'^ + 1/2^ + 1/3^ + • • • as n —>■ oo. We consider spanning trees 

^ , constrained to have depth bounded by k from a specified root. We prove that if 

' k > log2 logn + a'(l), where is any function going to oo with n, then the minimum 

' bounded-depth spanning tree still has weight tending to C{3) as n ^ oo, and that if 

p I . k < log2 log n, then the weight is doubly-exponentially large in log2 log n — k. It is NP- 

(-H \ hard to find the minimum bounded-depth spanning tree, but when k < log2 log n— 

■ a simple greedy algorithm is asymptotically optimal, and when k > log2 logn + 

g . an algorithm which makes small changes to the minimum (unbounded depth) spanning 
tree is asymptotically optimal. We prove similar results for minimum bounded-depth 

^ ' Steiner trees, where the tree must connect a specified set of m vertices, and may or may 

^ ■ not include other vertices. In particular, when m = const x n, if /c > log2 logn + uj{1), 

OO I the minimum bounded-depth Steiner tree on the complete graph has asymptotically the 

I same weight as the minimum Steiner tree, and if 1 < /c < log2 logn — the weight 

'nI" I tends to (1 — 2~'^)y^8m/n [V2nin/2'^] ^^^"^ in both expectation and probability. The 

■ same results hold for minimum bounded-diameter Steiner trees when the diameter 
. bound is 2k; when the diameter bound is increased from 2k to 2k + 1, the minimum 

Q I Steiner tree weight is reduced by a factor of 2^/^^ 

>x! ■ 1 Introduction 



For graphs with random edge weights, we study minimum spanning trees and Steiner trees 
in which there is a bound on the diameter or else a bound on the depth from a specified 
root vertex. We obtain precise estimates of the weight of the optimal tree, and some of the 
results are surprising. There is a sharp cutoff in the depth/diameter constraint, above which 
the constrained minimum spanning tree has almost the same weight as the unconstrained 
minimum spanning tree, and below which the weight blows up. 



2000 Mathematics Subject Classification. 05C80, 90C27, 05C05; 60C05, 82B26, 68W40, 68R10, 68W25. 

Key words and phrases. Minimum spanning tree, Steiner tree, random graph, bounded-diameter span- 
ning tree, bounded-depth spanning tree, hop constraints, threshold, cutoff phenomenon. 
* University of Toronto. 
^University of British Columbia. 

Microsoft Research. 
^University of Washington. 



1 



Minimum bounded-depth/ diameter spanning and Steiner trees Angel, Flaxman, Sz Wilson 2 



1.1 Definitions 

The minimum spanning tree (MST) of an edge-weighted undirected simple graph G on n 
vertices is the spanning tree which minimizes the sum of the edge weights. The Steiner tree 
problem also specifies a set T of m = |T| < n terminal vertices that are to be connected 
by the tree; the tree may or may not contain the other vertices in the graph G. We denote 
the minimum spanning tree of G by MST(G'), and the minimum Steiner tree by MST(G, T). 
Prim's algorithm and Kruskal's algorithm are two classic efficient algorithms for finding the 
MST(G), but computing the minimum Steiner tree MST(G', T) is well-known to be NP-hard 
[GJ79] . 

The bounded-depth Steiner tree problem, also known as the Steiner tree problem with 
"hop constraints," is an abstraction of several important real-world combinatorial optimiza- 
tion problems, including designing telecommunications networks with a maximum transmis- 
sion delay bound |GM03] and solving lot sizing problems with a limits on the number of 
periods goods can in stock |VoB99] . We denote by MST {G,T) the minimum weight 

depth(r) < k 

Steiner tree of the graph G connecting the terminal vertices T such that each vertex is 
within distance k from the root vertex r. Similarly, MST {G,T) denotes the minimum 

diam< k 

weight Steiner tree of G connecting the vertices in T with diameter bounded by k. The min- 
imum bounded-depth and bounded-diameter spanning trees are of course the special case 
where T is the set of all vertices. 

For a tree T, we let wt(T) denote its weight. 

There has been extensive research (which we describe below) in computer science, math- 
ematics, operations research, and physics on the bounded-diameter and bounded-depth ver- 
sions of these problems. In the bounded-diameter version, the minimization is only over 
trees which satisfy a bound on their diameter (maximum number of edges within the tree 
connecting a pair of vertices), and in the bounded-depth version, the minimization is over 
trees satisfying a bound on the maximum distance from a pre-specified root vertex. (The 
bounded-diameter and bounded-depth versions are closely related.) 

1.2 The MST on random graphs 

There has been a lot of research on the properties of MST's on random graphs. Two of the 
most well-studied ensembles of random graphs are the following: 

1. The vertices of G are the points of a Poisson point process in Euclidean space, with 
the edge weights being the Euclidean distance between the points. The MST for these 
geometric graphs G has been studied in |AB92j |Pen03[ Chapter 13] |CIL"'"07] and other 
articles. 

2. The graph G is the complete graph with the edge weights being i.i.d. copies of a 
random variable, such as an exponential with mean 1, or a uniform number between 
and 1. (It turns out to matter very little which random variate occurs on the edges.) 
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In 1985, Frieze showed that the expected cost of the minimum spanning tree on the 
complete graph with edge weights distributed independently and uniformly between and 1 
tends to a constant as n tends to cxd, and the constant is C(3) = 1/1^ + 1/2^ + 1/3^ + ■ ■ ■ = 
1.202. . . |Fri85] . In our notation, this says E[wt(MST(_K'„))] — * C(3) as n — > oo (we abuse 
notation by making the edge weight distribution implicit in the graph K^^. A concentration 
result was also proven, so the actual weight is with high probability close to C(3) [FriSSj . Even 
more precise results are known: the distribution of wt(MST(i^'„)) converges to a Gaussian 
with mean ^(3) and variance (6^(4) — 4C(3))/n |Jan95l IJWOGj . Since most edges in the 
optimal tree have weight close to 0, so long as the weight random variables have a density 
function that is 1 at weight 0, wt(MST(ir„)) C(3) with high probability |Ste87l[FM89] . 

Regarding the structure of MST(_ft'„), it is known that, with high probability, the diameter 
of MST(ir„) is e(ni/3), and the expected diameter is also Q{n^l^) [ABBROSj . (This is in 
contrast to the uniformly random spanning tree on the complete graph, which has diameter 
0(n^/^) in probability and in expectation |RS67t ISze83] .) From this diameter bound, it 

follows that wtf MST [Kn) \ C(3) and wtf MST [Kn) \ C(3) in probability. 

Mepth< <^(ni/3) / Vdiam< (^(n^/^) / 

We prove that this convergence still holds for a much more restrictive diameter bound or 
depth bound: 

Theorem 1.1. For the complete graph Kn with Exp(l) edge weights, if k = log2 logr;, + co'(l), 
where uj{1) is any quantity that tends to oo, however slowly, then 

wtf MST (Kn)) -^C(3) and wtf MST (K„)) ^ C(3) (1) 

\depth< k / \diam< 2k / 

in both probability and expectation. This is tight in the sense that when k = log2logn — A, 
wtf MST (Kn)) = exp(2^+®(^)) and wtf MST {Kn)) = exp(2^+®(^)). (2) 

\depth< k / \diam< 2k / 

Thus there is a sharp cutoff at depth log2 logn ± 6(1) (diameter 2 log2 logn ± 0(1)). 

Remark 1.2. Any tree of depth k has diameter < 2k. Any tree of diameter 2k is also a 
tree of depth k, rooted at a uniquely defined central vertex. Similarly, any tree of diameter 
2fc + 1 is a tree of depth k rooted at a central edge. Thus, the principal difference between 
MST and MST is that in the first case, the root vertex is pre-specified, and in the second 

depth< k diam< 2k 

case, any vertex may serve as the root. Thus, 

wtf MST {G,m)) < wtf MST {G,m)), (3) 

\diam< 2k / \dcpth< k / 

and there are some parameter values for which the bounded-diameter tree is quite a bit lighter 
than the bounded-depth tree. For example, if m = 2 and /c = 1, we have wt ( MST {Kn, 2) 



\diam< 2 

6(1/1/72) while wt ( MST {Kn, 2) ) = 6(1). But for the parameter values covered by our the- 

Vdcpth< 1 / 

orems, it turns out a posteriori that wtf MST {Kn,m) ) = (1 + o(l)) wtf MST {Kn,m) ]. 

Vdiam<2fc / Vdepth< fc / 



Minimum bounded-depth/ diameter spanning and Steiner trees Angel, Flaxman, Sz Wilson A 



Remark 1.3. This theorem also holds when the edge weights come from other distributions, 
as discussed below. 

Steiner trees in networks with uniformly random edges weights on the complete graph Kn 
were investigated in |BGRS04"] . Since Kn is symmetric, it is only necessary to specify the 
number m of terminals rather than the precise set. There it was shown that when 2 < m < 
o(n), 

771 — 1 77 

E[wt(MST(ir„, m))] = (1 + o{l))- — - log -. 

n m 

When m is of the same order as n, say m = an, the value of E[wt(MST(ii'„, an))] is 6(1), 
but its precise limiting value is not known except when a = 1 (where it is C(3))- We prove 
that this same weight can be achieved when suitably restricting the diameter: 

Theorem 1.4. If k > logg log + co'(n/(m log(en/m))), and the edge weight probability 
distribution has density 1 at 0, then 

wtf MST (K„,m)) wtf MST {Kn,m)) 

\depth< k / , , Vdiam< 2k / _ , , , 

wt(MST(ir„,m)) wt(MST(ir„,m)) ^ ' 

in probability, and if the expected edge weight is finite, convergence holds in expectation too. 

In the case m = n. [Theorem 1.41 yields the first part of [Theorem l.ll but for Steiner trees 
with general m we do not know if there is a sharp cutoff in the same sense that there is 
for the minimum spanning tree, though whenever m = G(n) there is still a sharp cutoff at 
log2logn±e(l). 

The next theorem gives the weight when the depth is smaller than log2 log n. 

Theorem 1.5. If 2 < k < log2 logn — log2 log(en/m) — ti;(l), and the edge weight probability 
distribution has density 1 at 0, then 



Vdiam< 2k 



wtf MST (A-„, ,„)) = (1 - 2-' ± 0(1)) ^ ) (6) 

\diam< 2k + 1 J \ U \ 2'^ j 

in probability, and if the expected edge weight is finite, convergence holds in expectation too. 
(These formulas are valid for k = 1 too, when the weights are Exp(l) random variables.) 

For example, when k = 2 and m = an, [Theorem 1.51 implies that with high probability 
wtf MST {Kn,an)) = {1 + o{l))la^^'^n^^\ 

\depth< 2 / 2 

The second part of [Theorem 1.11 follows from [Theorem 1. "5] upon specializing to the case 
m = n and k = logg logn — A. 
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1.3 Computational intractability and approximation algorithms 

Obtaining optimal trees is computationally intractable in general. The minimum bounded- 
diameter spanning tree problem is NP-hard for any diameter between 4 and n — 2 |GJ79[ 
pg. 206] , and the minimum bounded-depth spanning tree problem is NP-hard even for depth 2 
(this can be shown by a reduction from the facility location problem, see |DGR06j ). Of course 
the bounded-diameter and bounded-depth Steiner tree problems are only harder, so they too 
are NP-hard. In fact, for any fixed diameter > 4, it is NP-hard to even approximate the 
minimum bounded-diameter spanning tree to within an approximation ratio of better than 
O(logn) [BIKPOlj . 

Because of this complexity, numerous algorithms have been investigated, including exact 
(but time-consuming) integer programming formulations |AC92t IAC93t IGROSaj . fast rig- 
orous approximation algorithms [BIKPOll IKP991 IAFHP"'"05] , and heuristic approximation 
algorithms |DGR06l IVojj99l IGou95l iM^ilOTl IGou96[ IGGLOSbl IGGLOSal [RJ031 [JH03| IGR05b| 
IGvHROGl IKopOGj IPutOTl IZauOSl IBBB+OS] . 



These intractability and inapproximability results are of course for worst-case graphs, 
and for random graphs one can do better. One of the heuristic algorithms, based on "survey 
propagation" and "the cavity method," was recently tested on random graphs |BBB"'"08j . 
which led us to investigate the weight of the true optimal minimum bounded-depth and 
bounded-diameter spanning tree and Steiner tree on random graphs. There were also some 
earlier investigations of the minimum bounded-diameter spanning tree on random graphs 
[ADF991 IAD02j which consisted of testing the performance of several other heuristic algo- 
rithms. In the present paper we rigorously analyze the asymptotic weight of these trees. 
Indeed, we also describe two algorithms that approximate well the constrained spanning (or 
Steiner) tree problem. 

1.4 Proof strategy 

In the remaining sections of this paper, we present the proofs of these theorems. We saw 
already that [Theorem 1.11 is implied by Theorems 11.41 and 11.51 

The upper bound in lTheorem l.Sl follows from analyzing (in § 2 ) the Steiner tree produced 



by a simple greedy heuristic algorithm, which estimates how many vertices should occur in 
each level of the tree, and picks the cheapest set of this many vertices connected to the 
previous level of the tree. The proof of the upper bound in [Theorem 1.41 is also algorithmic. 



and appears in |§ 3[ There, the strategy is to start with the minimum unconstrained spanning 
or Steiner tree, delete a small number of edges to break the tree apart into pieces, and 
then splice these pieces back together using the greedy algorithm from [Theorem 1.5[ The 
resulting tree has almost the same weight as the original tree, and it has very small depth. 
Both of these algorithms produce trees rooted at a pre-specified vertex, so they yield both 
the bounded-depth and bounded-diameter upper bounds. 

The lower bounds in lTheorem 1.41 are self-evident, since the weight of the unconstrained 
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minimum Steiner tree is an obvious lower bound on the weight of a constrained Steiner tree. 

and make use of a tight concentration 



The lower bounds in [Theorem 1.51 are proved in 
inequality that is derived in 



The lower bound applies to any Steiner tree (where any 
vertex can be the root in a tree of diameter 2/c, or any edge can be the root in a tree of 
diameter 2fc + 1), so recalling ([3]), the proof yields both the bounded-diameter and bounded- 
depth lower bounds. 

and 



When carrying out the above calculations in § 2, § 3, § 4 



we assume that the 

edge weights are distributed according to exponential random variables with mean 1, since 
this distribution is quite natural and simplifies many of the calculations. In §_6, we show 
how our results for exponential random variables imply the corresponding results for other 
distributions (when the depth bound is bigger than 1). 



2 Greedy tree 
2.1 Construction 

Let us consider the following greedy method for algorithmically growing a low-weight span- 
ning or Steiner tree with bounded depth or diameter. We will build a tree in which every 
vertex is within distance k from a particular root vertex or root edge. (For bounded-diameter 
trees, the root may be chosen arbitrarily.) This greedy tree 7^ depends on a sequence of non- 
negative numbers i = {io,h, ■ ■ ■ such that io = I (if the root is a vertex) or = 2 (if 
the root is an edge), and ^^^j = n if the desired tree is a spanning tree, and = m if 
the desired tree is a Steiner tree. The idea is that the greedy tree Tg will have exactly ii 
vertices at distance i from the root, except that for Steiner trees there may be fewer than 
ik = m vertices at the k^^ level if the terminal vertices were used closer to the root. The 
construction is inductive. Level is the root vertex, or pair of vertices if the root is an 
edge. For convenience, we let Si = io + ■■■ + ii. For i < k, suppose we have chosen the 
vertices at level i — 1 and below. For each of the unchosen n — vertices, we look at the 
lightest edge connecting it to level i — 1, and, to form level i of the tree %, we choose the ii 
of these vertices that have the lightest edges to level i — 1. For level i = k, we connect the 
terminal vertices (that have not already been included in the tree) using their lightest edge 
to level k — 1. 



2.2 Approximate weight 

The choice of the sequence i has great influence on the total weight of the resulting tree %. 
Let wti{%) he the (random) total weight of the edges within the greedy tree connecting 
level 2 — 1 to level i. For each of the n — vertices not in the tree up to level i — 1, 
the weight of the lowest weight edge leading to it from the vertices at level i — 1 is an 
exponential random variable with mean When picking the ii vertices at level i < k, 
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we pick the ii smallest of these random variables. The j**^ smallest has expectation 



"i— 1 



+ 



and so 



E[wti(r,)] 



n - Si_x n - Si_i - 1 



+ 



+ ■■■ + 



n - Si_i - (j - 1) 



+ ■■■ + 



n - Si_i n - 



n - Si_i - {ti - 1) 



In |§ 4| we derive a concentration result for these random variables. 

In the case of spanning trees (m = n), the above formula also holds for level i = k, and 
simplifies to E[wtj(7£)] = For general Steiner trees, at level k we have E[wtj(7£)] < 

ii/ii-i because some of the terminals may have been selected already. 

Thus the expected weight E[wtj(7£)] of the i^^ level may be approximated by 



E[wt,(r,)] 



2n£i_i 



^-1 



mii_i 



if Si <^ n; 

if Si = n, i.e., i = k. 



If Sk-i <^ n then the above approximation holds for all i. Next we choose a good sequence 
that makes E[wt(7£)] = X]t^[*^«(^)] sniall. 



2.3 An optimization problem 

It is convenient to define /„(a, 6) = }p'/{2na) and 

/n,cW = ^1, . . . , 4) = /n(4, 4) + " " " + /n(4-2, 4-l) + c/„(4-l, 4) 

- K TT-^r-^ + „ , 



2n£o 2n4-2 2n4-i 

(a factor of c appears only in the last level). We have argued that the greedy tree with level 
sizes i has expected weight approximately fn,2n/m{^), provided that most of the nodes occur 
in the last level. Next we optimize fn,c{^), which is a deterministic function of i, allowing the 
level sizes to be real numbers rather than constraining them to be integers. We further relax 
the constraints on the sum of the level sizes + ■■■ + £}: = n for MST and the corresponding 
constraint for the Steiner tree), and instead fix £o(= 1) and ik, and optimize the intermediate 



level sizes. In § 2.4 and § 2.5 we return to the question of how well this approximates the 
weight of the greedy tree with constrained integer level sizes and random edge weights. 

For ii-i and ii+i held fixed, let us find the choice of ii which minimizes /„(£). If z + 1 < 
then the two terms of /n(^) involving ii are 

/n(^i-l;^i) +/n(^i;^i+l) = Tj — 7 1" 
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which is minimized when 



_ ^1+1 _ p. 
e2 ~ ^' 



i.e., when fn{.UAi+i) = 2fn{ii-i, li). In the last level, with i = A; — 1 we still have that the 
optimal choice of £i yields fn{ii,ii+i) = 2fn{ii-i,ii). 

An optimal sequence of iiS should satisfy this for all i, so we wish to solve the recursion 
subject to ^0 = 1 with given i^. Let rj = If i + 1 < A; we have 2£f — i.e., 



For i ^ k -1 we have 2£^ = c£i-i£f^i so 

2ri = crl^i. 

When the ratios n satisfy these equations, we have, for all i < k, 

n = 2(rfcx/^/2)2'"\ 

Multiplying, we find for i < /c 

k 

h/ii = n = '^'-'{nv~c/2f-'-'/v~c. 
j=i+i 



In particular. 



so 



4 = 4/4 = 2'=(r,V^/2)2'=- VV^, 
2 /4v^^^ 



and for i < k, 



2K-1 



^^ = 2^^) . (7) 



On this optimal sequence i, we have 



Minimum bounded-depth/ diameter spanning and Steiner trees Angel, Flaxman, Sz Wilson 9 



2.4 Bounded-depth trees 

How do we relate this sequence to the greedy minimum bounded-depth spanning tree or 
Steiner tree? Let I denote this optimal sequence when 1^ = m and c = 2n/m. Let us greedily 
place nodes in level i for i < k, and then connect the remaining nodes to the last level 
of the tree. For fixed m/n, so long as < log2logn — uj{l), we have S> 1, so for each 

i < k we have Si -C n, so the expected weight of level i of the tree is (1 + o(l))/„( , ) 
for i < k, and is at most c/„([£j_i], for i = k. Furthermore, for each i > we have 

ii ^ 1, so the rounding to integers only causes a (1 + o(l)) multiplicative correction. Thus 
the expected weight of this greedy minimum spanning tree or Steiner tree is at most 



E[wt(7;-)] < (1 + o(l))/,,2n/mW = (1 + o(l))v/8m/n(l - 2" 



V2 



mn 



2k 



This is the upper bound of the in-expectation part of [Theorem LSI for bounded-depth Steiner 
trees when the edge weights are Exp(l) random variables. Concentration will follow when 
we prove the lower bound in § 



For example, when k = 2 and m = an, the best choice is ii ~ a^^'^n^^'^, yielding a total 
expected weight of about la^/^n^/^. 

We will also be interested in taking larger /c's, namely k = log2 logn -|- 0(1) and larger. 
If we simply substitute this k (or any larger k) into the estimate for the weight of the greedy 
tree, we would get E[wt(7^-)] ~ = 6(1). This estimate for E[wt(7^-)] is not valid, because 
for i = k — 0(1) we have Sj = G(n), so the ]E[PVj] are larger than the above formula gives. 
However, these E[iyj]'s are only a constant factor larger than predicted, so we still have 
E[wt(7})] = 9(1). 

Intuitively, all the choices that we made when constructing the greedy tree are close to 
optimal. We will see in the next section that it is in fact possible to build a better tree by 
making non-greedy choices when k is larger than logjlogn -|- a;(l) (the construction there 
makes use of this greedy tree, combining it with the optimal minimum spanning tree with 
unbounded depth). However, we will prove in the lower bound section that the weight of 
this greedy tree is within a factor of (1 -|- o(l)) of the weight of the optimal tree so long as 
k < log2 logn — ci;(l). 



2.5 Bounded-diameter trees 

Having proved the upper bound for wt( MST {Kn,m) ), we now consider the case of trees 

Vdepth< k / 

with a bounded diameter. As noted, the primary difference is that now there is no fixed root 
from which to measure distances. The following is an easy observation (which will be more 
useful for the lower bounds). 

Lemma 2.1. A tree with diameter 2k contains a unique root vertex, from which the tree 
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has depth k. A tree of diameter 2k + 1 contains a unique edge so that all vertices are within 
distance k of an endpoint of the edge. 

Proof. Take a path of maximal length in the tree, and take as the root vertex or edge the 
central vertex or edge of the path. The bounds on the depth follow from the maximality of 
the path. □ 



To prove the upper bound for E 
is also a tree of diameter 2k. Thus 

wt ( MST {K, 



wtf MST {Kn, 



\diam< 2k 



just note that any tree of depth k 



\diam< 2k 



< wtf MST {Kn, 

V depth < k 



m, 



To prove the upper bound for E 



wt ( MST 

Vdiam< 2k H 



we fix a root edge with small 



weight, then repeat the argument for the weight of the greedy tree, except that level now 
has size Cq = 2 rather than 1. There is an easy way to relate the optimal costs without 
repeating the optimization problem. 

The key is that the cost /n,c(^) is homogeneous in the sequence £. Consider the optimal 
sequence £ for spanning m/2 terminals among n/2 vertices, then 2^ is the optimal sequence 
for spanning m terminals among n vertices, except that it starts with two vertices at level 0. 
Thus we can repeat the greedy construction to find that 



E 



wtf MST {Kn,m] 

Vdiam< 2fc + 1 



< (1 + 0(l))/,,2n/™(2£) 
= {l + 0{l))fn/2,2n/m{i) 

= our upper bound on E 



wtf MST (ir„/2,m/2) 



V depth < k 



This is the upper bound of the in-expectation part of [Theorem 1.51 for bounded-diameter 
Steiner trees when the edge weights are Exp(l) random variables. 



3 Sliced-and-spliced tree 

To construct a small-diameter spanning tree with weight close to C(3)) the idea is to take 
the true (unconstrained) minimum spanning tree (or Steiner tree), break it apart into small 
subtrees which each still contain many vertices, and then splice the subtrees together using 



the greedy tree approach from |§ 2} The resulting tree is locally the same as the MST, so the 
weight is about the same, but globally it has been rewired to have much smaller diameter. 

Recall that the weight of the unconstrained Steiner tree is (1 — o(l))(m — l)/nlog(n/m) 
(w.h.p. and in expectation) when m <^n |BGRS04] . and that (using also Frieze's result on 
spanning trees [Fri85j ) consequently wt(MST(i^'„, m)) = Q{m/n\og{en/m)) for 2 < m < n. 
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Theorem 3.1. Suppose 2 < m < n and k = log2logn + A where A > 'n,/(mlog(en/m)), 
and the edge weights are exponential random variables with mean 1. Then 



E 



wtf MST {Kn,m)] - wt(MST(ir„,m)) 

\depth< k / 



< o 



'm\og{en/m) 
nA 



Proof. Each edge weight We is given by an exponential Exp(l) with mean 1. For < e < 1, 
we can write We = min(Wg, w'^) where w'^ and w" are independent exponentials with mean 
1/(1 — e) and 1/e. Let X = wt(MST(i^„, m)), i.e., the weight of the unconstrained minimum 
Steiner tree with the We edge weights. Let us take the unconstrained minimum spanning or 
Steiner tree T using the w'^ weights. The weight of this tree satisfies X < wt(T) = X/(l — e). 
The edge weights w'^ are independent of T, which will soon be useful. We will eventually 
choose < e < 1. Assuming e < 1/2, we have E[wt(r) - X] < 0{eE[X]). 

Any tree with diameter greater than A may be broken up into two trees with diameter 
> [A/2J by removing the middle edge of any path realizing the diameter. Let us repeatedly 
break up the Steiner tree T in this way until we are left with a collection of subtrees with 
diameters between [A/2J and A, of total weight < wt(T). 

We view these subtrees as meta-nodes. We will construct a spanning meta-tree of depth 
[logglognj on these meta-nodes, with the property that its edges are attached to a single 
vertex within each meta-node. This implies that the depth of the resulting tree (rooted at 
the marked vertex in the meta-root) is at most log2 logn + A. This is done using the greedy 
tree construction. We start with any subtree as the root meta-node, and mark an arbitrary 
vertex within it. When building a layer of the greedy tree, we only consider edges from the 
marked vertices of the previous layer of the tree, but these edges may connect to any vertex 
of a subtree not yet in the greedy tree. For each sub-tree there are at least [A/2j-|-l>A/2 
such edges, and each one has weight at most w'^ (here we use that the w'^ are independent of 
the spanning or Steiner tree). Thus the weight of the lightest connection between a marked 
vertex and a sub-tree is dominated by an Exponential with mean 2/(eA). When a subtree 
gets connected, the vertex by which it gets connected becomes the marked vertex. Let S 
denote the total weight of the splice edges within the greedy meta-tree. By our results from 
2} E[S] < 0(l/(eA)). 

The weight of the sliced-and-spliced tree is at most wt(T) + S. To optimize this bound, 
we pick e to minimize 0{eE[X]) + 0(l/(eA)), i.e., e = Q{1/^E[X]A), and since A > 
n/{mlog{en/m)) = B(1/E[X]) was one of our assumptions, we can pick such an e < 1/2. 

Thus, wt^ MST (Kn,m)^ is at most the sliced-and-spliced tree weight, which is at most 
X = wt(MST(i^„, m)) plus a quantity which in expectation is at most 6 ( y ) • □ 



The proof is wasteful in the separation of edges (or weights) into two independent com- 
ponents. If this is not done, then the edges between sub-trees are likely not to have come 
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from the MST, and so tend to be heavier. It seems that this should only give a constant 
factor to the cost of the meta-tree. 



Proof of Theorem I.4 for exponential weights. Using [Theorem 3TT1 ii k = logj log n+A where 
A > uj{n/{m\og{en/m))), then 



E 



wtf MST {K^,m)] - wt(MST(ir„,m)) 

\depth< k / 



< o(E[wt(MST(ir„,m))]), 



so the convergence in probability for bounded-depth Steiner trees is an immediate con- 
sequence of Markov's inequality, and convergence in expectation is also immediate. The 
bounded-diameter statements are a consequence of the bounded-depth statements. □ 



The case of other distributions is handled in 5 6 



4 Concentration of level weights 

Let Ui, . . . ,Up be a pool (set) of p i.i.d. exponential random variables, and for b < p, let 
Wb^p be the sum of the b best (smallest) UiS. The distribution of Wb^p plays a key role in 
the behavior of the weights of bounded-depth minimum spanning trees and bounded-depth 
Steiner trees. The total weight Wi of the edges connecting levels i — 1 and i in the greedy 



tree from § 2.1 is given by 

wt,(T,) = -J-We. 



We derive here some basic properties of Wb^p, including its expected value, and the probability 
that it deviates far from its expected value. 

Let Yi be the i^^ smallest of the f/j's. Since the minimum of independent exponentials is 
again an exponential, and since an exponential conditioned to be larger than some value is a 
translated exponential, we have Fj+i — Yi = ^-^j, where the Xj's are i.i.d. Exp(l) random 
variables (where by convention Yq = 0). It follows that 

b fe-l , . 

Ex ^ — I 

i=l i=0 ^ 

Thus 



i=0 ^ 



Let us approximate the expected value by 



— , . . , , f b\ b^ b^ b^ \<byp, 
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we have 



Lemma 4.1. For any 6 > and b < p, 



Pr 



< Pr [Wb,p < (1 - S)E[Wb,p]] < exp 

Proof. We use the method of bounded differences (see e.g., |McD89] ). For /? > 0, we have 

PT[Wb,p <x] = Piie-f^^"''' > e"^^] < e^^'E[e-^^''-''] 



6-1 



1=0 
6-1 



nr 



1=0 



+ (3{b-t)/ip-t) 



exp 



i=0 



— i 
p — i 



Because — log(l + u) < —u + for m > 0, we have 



Pr[iyfe^p < x] < exp 
< exp 

Letting x = (1 — 5)E[iyfc^p], we obtain 

FT[Wt,p < (1 - 6)E[Wh,p]] < exp 

and setting P = SK[Wb,p]p'^ /b^ , we obtain 

FT[Wb,p < (1 - 6)E[Wb,p]] < exp 

and since E[iyf„p] > ^^/(2p), we conclude 

FT[Wt,p < (1 - 5)E[Wfe,p]] < exp 



6-1 



i=0 



p — i 2 (p — iy 



3^b^ 

P(x-E[W,,p]) + ^-^ 



0^ b^' 

-P5E[W,,p] + ^- 

z, p ^ 



-mw,,pr^. 



-6' 



□ 



When bounding Pr[VI/fc^p < (1 — 6)b'^/{2p)], it is possible to get a constant of 3/8 rather 
than 1/8, and 3/8 is tight. But we also use Pr[l^fe^p < (1 — 5)E[l^feJ] in §_5, and in the end 
this constant does not affect the asymptotic lower bound that we prove there. 
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5 MST lower bounds 



5.1 Strategy 

For sets A and B of vertices, let F{A, B) be the minimal total weight of a set of edges 
connecting each vertex in B to some vertex in A. Note that F{A, B) is increasing in B and 
non-increasing in A. Define 



F(a,b) = min F(A,B) 

\A\<a,\B\>b 
AnB=0 



min F(A,B), 

\A\=a,\B\=b 
AnB=$ 



i.e., the minimal cost for connecting at least b vertices to at most a vertices. Next let 

k 



i=l 



this is a lower bound on the cost of any spanning tree of depth k whose level sizes are given 
by {io,ii, . . . ,ik)- (Mnemonically, the F's are random variables determined by the edge 



weights of the graph, and the /'s from § 2.3 are deterministic quantities which we argue are 
likely to closely approximate the F's.) We can obtain a sharper lower bound by treating the 
last level differently. In particular, for the last level of Steiner trees, we need only consider 
sets B which contain only terminal nodes of the Steiner tree. The sharper bound is then 



k-l 



where Fm{a, b) is defined as F(a, b) was, but with the set B restricted to be a subset of the 
m terminals of the Steiner tree. 

Let {io,ii, . . . ,ik) be the "greedy sequence" of level sizes that optimizes /n,2n/m(^) and 



which we used for the greedy tree in %_2 Our strategy is to show that Fm{i) is approximately 
minimized at Fm{i) for n large enough, and that Fm{i) is within a factor (l — S) of fn,2n/m{^)- 
It will follow that for n large enough, the weight of the greedy tree is close to the weight of 
the optimal tree. 

Lemma 5.1. For any 6 > 0, a & {1, . . . ,n} and 6 G {1, . . . ,n — a} we have 



Ft 



F{a,b) < (1 -5) 



2na 



< 



exp 



a log S b 

a 8 



Proof. Fix a set A of size at most a. For each x ^ A the minimal weight of an edge connecting 
a; to A is an independent Exp(l). Let Wa be the total weight of edges connecting b vertices 
to A. By ILemma 4.1t 



Pr 



Wa<{1- S ) 



2na 



< Pr 



Wa<{1- 6) 



2in~\A\)\A\ 



< exp 
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Finally, the number of sets A of size at most a is 



+ ••■ + 



and a union bound yields the claim. 

Lemma 5.2. For any 6 > 0, a G {1, . . . ,n} and b G {1, . . . ,m — a} we have 



□ 



Pr 



/ , N / / m — h , m 
FUa,b) < (1-5) 1 —log 



m — b J a 



1 - o(l) ifm/b ^ 1 



< exp 



1 ^X2l, 

a log 

a 8 



A^'ote that the bound on Fm(a, 6) is (1 — o(l))6^/(ma) m the limit S and b/m 



Proof. The proof is essentially the same proof used for ILemma 5.1\ except that we use the 
more precise estimate from ILemma 4.ll □ 

In other words, if b is large enough compared to a, then it is unlikely that F{a, b) is 
much smaller than fn{a,b) = b'^/{2na), and Fm{a,b) is unlikely to be much smaller than 
fn,2n/m{ci,b) = b"^ / (ma) . Let us define 



R&,n{o) 



16 ne 
— alog— . 
o-^ a 



Let 



fi^^'"\a, b) = lb>fi^ „(a) 



2na 



Then with high probability, for each a and b we have F{a,b) > fn^'^'"\a,b), indeed, the 
probability that this fails is at most 

1 1 1 2 _ 3 

(o) + (i)^(o) + (:) + G)^'"--^^" ^n(n-l)-n 

Let 



(Rs,n)i 



j— 1 ; j j 



E 



L^,>iJi,„(4_i), 



and f^c''^\tj be defined similarly, but with an extra factor of c in the k}"^ term of the sum. 

Corollary 5.3. With high probability (> 1 — 3 / n) , any spanning tree with level sizes given 
by (io, . . . has weight at least (1 — S)fi^^'"\i). If ik = {1 — o(l))m, then any Steiner 
tree connecting a given set of m terminals with level sizes {io, . . . ,ik) has weight at least 

{l-6-o{l))f^Xti^- 
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Thus we are done if we show that fnj {() constrained to X^j-^j > m is almost minimized 
at the sequence 1 = {Iq, . . . ,ik) that minimizes fn,c{() constrained to ik — m (and which we 
used in the greedy tree construction). 

Definition 5.4. We say that the k^^ level of a sequence i is large if -^fe > (1 — 5^/16)m. For 
t >1, we say that the {k — ty^ level is large if 

t 

4-*>^R^Jir~R2^(m)---). 

Since is monotone increasing in a up to a = n, the inverse function Rjj^{b) is well- 
defined for 1 < 6 < n. Since Rj^i^) < ((5^/16)6, and X^^^ > m, it follows that there must 
be at least one large level. 

We may enlarge the set of €s over which we are minimizing; so long as fi^^'"\i) is 
still almost minimized at i, we will have our desired lower bound. Naturally we relax the 
constraint ii E N to ii E M"*". We keep the constraint io = I- We shall drop the ^^ii > m 
constraint, and replace it with a constraint that there is a large level in the above sense, 
since this only increases the set of sequences that we are optimizing over. 

5.2 No small jumps 

Call a jump from ^i to large if > Rs^ni^i) and small otherwise. Suppose that a 
sequence contains a small jump If ii^i < then we may increase fj+i or 

decrease ii, and each term of fi^c'"\i) either stays the same or decreases. Thus we may 
assume that all small jumps are from a to i?5^„(a). If two consecutive jumps are large then the 
intermediate value must satisfy — We wish to show that a sequence achieving 

the minimum value of fi%^'"^ in fact has no small jumps, which will allow us to find the best 
sequence. We start by showing that it does not have a small jump followed by a large jump. 

Lemma 5.5. There is a 5o > so that whenever c > 1 and < 5 < 5o, and a sequence i 
has a small jump followed by a large jump {£i,£i^i), it is possible to change so as 

to reduce fif'"\£). 

Proof. Let — a and = b. Let C = 16/5^. We may assume £i — Rs^ni^i-i)-, since 
otherwise replacing £j with R§^n{U-i) reduces fi^c'"'\£). There is a shght difference when 
i + 1 — k, ets opposed to i -1-1 < k, since the last (k^^) summand of fi^c'"'\£) contains a factor 
of c. We deal below with the case i + 1 = k. The case i + 1 < k differs only in that c does 
not appear, and is derived by replacing all c's by I's. We consider two possible replacements 
e'i and £'^ defined by 



Rs,n(£i) — £i+l- 
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The contributions to fi^c'"'^ from the two jumps in the three cases are 

^2 cb^ cb^ 



2nii 2nR{a) 2nCa log (ne/a) ' 

p2 Q /^2l4\V3 



2nii-i ' "2ni'i 2n \ 4a 



U" 



2nli-i 

Since b = Rs,n{ii) = Ce'{\og{ne/i'{) > we have b > Ci'l\og{ne/b), so 

62 



U" < 



If U" > U then 
If U' > U, then 



2naC2 log(ne/6)2' 
Cc log (ne/6) 2 < log(ne/a). 



b<C'c-'/^alog{ne/af^\ 

where C" = (3^)3/2/2 = 96V3/6\ 

If both t/' > t/ and U" > U then we find 

log(ne/a) > Cc log^ 



C'c'~^/'^alog{ne/a) ' 
If we denote q = log(ne/a), this becomes 

q > Cc{q - log(C"c-V2g))2. 

Since b < m and each jump increases the level size by at least 16/^2^ have a < 
((57256)m, so q> log[(256e/(5^)n/m]. Since C = 96V3/5^ we have q - logC" > |logg - 
const. Thus 

16 , ^ 

g > -g — log q + - log c — const 



6 



- log g + ^ log c - const^ 



But g ^ 00 as 5 ^ and c > 1, so this equation cannot be true for small enough S. Thus, 
provided 5 < Sq, we have min{C/', U"} < U, so replacing li with either £[ or £" reduces 

Consider the maximal t such that level k — t ol the sequence £ is large. The jump from 
ik-t-i to ik-t must be a large jump, since otherwise level k — t — 1 would also be a large 
level. Because (for small enough S) there are no small jumps followed by large jumps, it 
follows that the subsequence £^,£1, . . . , £k_t consists only of large jumps. If t 7^ this would 
imply that the total cost up to level k — t is too high for £ to be optimal. Our next step is 
to obtain a lower bound on £k-t-i for which we need the following lemma. 
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Lemma 5.6. Assume S < 1. Recall that C — 16/5^; and that V is the gamma function 
(generalized factorial). If for some r > 2 



then 



Proof. If 



then 



(ClogC)'-ir(r)2' 

. en 

^'"^ ^ - (Clog cyv{r + iY' 



en 

a < 



(ClogC)T(r + l) 



2' 



en ne 
Rs,n{a) < Cj^TT-TW^W—r^ log ■ 



(C log CyT{r + 1)2 ^ en/((C log C)'T(r + 1)2) 
r log(r2ClogC) en 

^ ~r~ III X 



r2 1ogC (ClogC)'-ir(r)2' 
The first term is 

r log(/-^C'log C) 2Iogr logC log log C 

r2 1ogC rlogC rlogC rlogC 

and since (log r)/r < 1/e and (log log C)/ log C < 1/e and r > 2, we have 

2 1 1 

< \ \ . 

- elogC 2 2e 

As long as 5 < 1, we have C > 16, so that this quantity is bounded by 1. □ 

Thus we get a lower bound on the size ik-t of the first large level. 
Lemma 5.7. If 5 < 1 and t > 1 and level k — t is a large level, then 

f > 



{tlog^/d)m- 
Proof. By our definition of "large," 



h-t>Rl'ni---Rj,l({l-SVl6)m)...). 
Next we find the smallest r > 2 satisfying 
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the relevant r satisfies r < 0(log ^/loglog so we can bound 

ik-t > 



m > 

en 



(ClogC)^+T(r + t + 1)2 

which, if r > 2 (so that ([9]) is tight), can be bounded by 

(l-^Vl6)m 
- (ClogC(r + t)2)*- 

Whether or not r > 2, we can bound 

m 

Lemma 5.8. If < 6 < 6o, there is a constant A such that whenever k < logglogm — 
log2 log(en/m) — A, any sequence i optimizing f^2n/m(^) contains no small jumps. 

( R ' ) 

Proof. If t > 1, then we may bound fn,c'" (i) from below by the cost of the first k — t levels, 
which by our earlier calculation ([8]) is 



f^{io,...,h-t) = ^il-2'-'''"'-' 
n 



■yk-t 



Let us assume t < k. Upon substituting our lower bound for ik-t from ILemma SlTl and 
c = 2n/m, we may compare this (LHS) to the sequence from the greedy construction (RHS): 



]_ 

ok-t_ 



1 7 



2™ ^ W(>gS)^"' V "s^/?^a-2- 



n(|log^)®(*)' M 2^=-* 



T log — 



Bit) 




e(^^^ (logm-0(A;)) ^0(tlog0 +0(tloglog^) +0(log£). 

Let us assume k < logglogm — A, where A is a suitably large constant depending on 6. 
Then the 0{k) term in the LHS may be neglected, and the LHS is fi(2^2*), which is larger 
than the Q{tlogt) term on the RHS. If in addition, k < logglogm — logg log(en/m) — A, 
then half the LHS is also fi(2^2* log(n/m)), which dominates the second and third terms in 
the RHS. 

Thus under these conditions on k, any sequence i with t 7^ is not optimal, and so any 
optimizing sequence for /^^^/^(^) contains no small jumps. □ 
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5.3 Lower bounds 

Proof of lTheorem lower bounds, for exponential weights. We start with the bound on 
wt( MST (Kn,m)]. Combining Corollary 5.3 with ILemma 5.8l we find that w.h.p. the 



\diam< 2k 



cost of a Steiner tree with level sizes given by i is at least (1 — f^^^^jl^{i) , and that this 



is minimized by the sequence constructed in |§ 2| to give the upper bound. Thus we find the 
upper bound is tight. 

As noted, wt( MST {K„,ni) ) > wt( MST {K„,ni) ), so its bound is also tight. 

Vdepth< / Vdiam< 2fc / 

Finally substituting m/2 and n/2 throughout, we find that no tree can improve by more 
than (1 + 0(1)) on the greedy tree construction for the odd diameter case. □ 

Proof of \Theorem concentration, for exponential weights. For each of the random vari- 
ables wt( MST (Kn,m)], wt( MST (Kn,m)], and wt( MST (Kn,m)], the random 

Vdiam< 2k J Vdiam< 2k + 1 J Vdepth< k / 

variable is almost never smaller than a factor of 1 + o(l) smaller than our upper bound on 
their expected values. It follows that for each of these random variables, the expected value 
is within a factor of 1 + o(l) of our upper bound on it, and that these random variables are 
with high probability within a factor of 1 + o(l) of their expected values. □ 



6 Other weight distributions 

In the proofs up to this point, we assumed that the edge weights of Kn are distributed 
according to an exponential random variable with mean 1. In this section we prove the 
parts of Theorems 11.11 11.41 and 11.51 that pertain to more general weight distributions. In 
our notation up until now we suppressed the weight distribution, but here we make it more 
explicit: we let denote the complete graph where each edge weight is an i.i.d. copy of a 
non-negative random variable W . 

The key observation (which was made earlier in the context of unconstrained minimum 
spannmg trees [FYi85l ISte87] and Steiner trees |BGRS04j ) is that w.h.p. only edges with 
weights o(l) are ever used (except when the depth is 1). Thus, it is principally the density 
of the distribution near that is significant. In this section we assume that the edge weights 
are i.i.d., and are distributed according to some non-negative random variable W that has 
density 1 near 0, i.e., for positive t near 0, 

Pr[l^? < t\ =t + o{t). 

(If the density near exists and is not 1, then linearity in the weights gives a multiplicative 
constant in the theorems.) We let W denote an exponential random variable with mean 1, 



W ~ Exp(l), 
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and for £ > define 



r "'-^ and W' = r ""'^^ 

£ ly > e, \oo W > e. 



For a generally distributed non-negative weight W with density 1 at 0, for any 5 > there 
is an e > such that 

{l-5)We ^ {l + 5)W', 

i.e., W is stochastically sandwiched between (1 — 5)We and (1 + 5)W^ . Since wt(MST) is 
monotone in the edge weights, it follows that bounded-depth/diameter tree weight distribu- 
tions are also stochastically sandwiched. 

Let us call an edge of the weighted graph e-light if its weight is at most e, and otherwise 
let us call it e-heavy. The idea is to show that w.h.p. the greedy, sliced-and-spliced, and 
optimal trees use only light edges (when A; > 2), so that it makes little difference whether 
the edge weights are distributed according to W or W . 

As we shall see, the density- 1- at- assumption is enough to get convergence in probability, 
but some additional assumption to rule out the possibility of very fat tails in the distribution 
is required to get the upper bounds for the convergence in expectation of the tree weights. 
We shall assume 

E[iy] < oo 

when deriving convergence in expectation. 
6.1 Upper bounds 

We start by proving that, in the greedy tree and sliced-and-spliced tree, heavy edges are 
rare. 

Lemma 6.1. If 2 < k < ^loggTi, the expected number of e-heavy edges contained within the 



greedy Steiner tree from ^ 2 is at most exp[6(logn) — G(min(£:, l)n)] + knexp[—en^^ 



Proof. The level sizes from ([7]) are monotone increasing in i, and monotone decreasing 
in k. At level 1, since k >2 we have 



< 2^2nm/A'' < ii < 2^nm/2 < 2n^l^ . 

The number of light edges emanating from the root is a binomial with parameters n — 1 and 
•p = \ — e^^ > e — 12. A standard large-deviation formula (see |McD89t Eqn. 5.6]) tells us 
that for any binomial random variable D, 



Vx\D < E[D]/2] < e 



~E[D]/8 
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Assuming e > 5n and n is large, so that E[Z}]/2 > 2n^/^ + l, we deduce that the expected 
number of heavy edges in the first level of the greedy tree is at most 

[2^2/3] e-("-i)p/8 = exp[e(logn) - Q{pn)]. 

(If e < 5n^^/3 or n is not large, the conclusions of the lemma are trivially true.) For any 
subsequent level of the tree, the number of heavy edges is at most the number of vertices 
not connected to it via a light edge, and since there are at least (n/A'^Y^^ vertices in the 
previous level, the expected number such vertices not reachable by a light edge is at most 
n{l — = n exp[— e(n/4'^)^/^] < ra exp[— en^/^]. Upon multiplying by A; — 1 (since 

there are k — 1 levels after the first) and adding the heavy edges from the first level, we 
obtain the desired bound. □ 

Proof of \Theorem 1.51 upper bounds, other weight distributions. The upper bounds for con- 
vergence in probability are an immediate consequence of the fact that for fixed e, w.h.p. there 
are not any heavy edges. The upper bounds for convergence in expectation follow from the 
fact the expected number of heavy edges is o(l), and the fact that K[W] is finite. □ 

The following lemma essentially appears in |BGRS04] . 

Lemma 6.2. For n> 3 and e > 0, the expected number of e -heavy edges in MST(i^'„,m) is 
at most 0(e~^"n^log n). 

Proof. Consider any edge of the Steiner tree MST{Kn, m) with weight greater than e. If its 
endpoints are connected by a path with total weight < e, then we could delete the heavy 
edge and replace it with a portion of or all of the low-weight path connecting that edge's 
endpoints, obtaining a lighter Steiner tree. (This argument appeared in |BGRS04] .) Janson 
proved that in the complete graph Kn with exponential edge weights, w.h.p. every pair of 
vertices is connected by a path of weight at most (3 + o(l))n~^ logri | Jan99] . In fact, it 
follows from |Jan99l Eqn. 2.8] that, when n > 3 and e > 0, the expected number of pairs 
of vertices not connected by a path of weight < e is at most 0{e~^'^n^\og'^ n). Thus, the 
expected number of heavy edges in MST(i^'„, m) is at most 0(e~^"n^ log^ n). □ 

Lemma 6.3. If k = logglogn-f- A where A > n/{mlog{en/m)), the expected number of 
e-heavy edges in the sliced- and- spliced Steiner tree /rom |§ 3| is at most exp[G(logra) — en] + 
0{l/ieVA)). 

Proof. There could be as many as exp[B(logn) — en] heavy edges in the starting Steiner 
tree MST(iir„,m). Of course, when we do the shcing of MST(i^„,m), no heavy edges are 
introduced, but heavy edges could be introduced when we splice the subtrees using the 
greedy-tree construction. In the construction, recall that the total weight of the splice edges 
was in expectation at most 

o y^^^^^ < oii/VA). 

The expected number of e-heavy splice edges can be at most 1/e times as large as this. □ 
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Proof of \Theorem 1.4\ other weight distributions. As above, the upper bounds for conver- 
gence in probability are an immediate consequence of the fact that for fixed e, w.h.p. there 
are not any heavy edges, and the upper bounds for convergence in expectation follow from 
the fact the expected number of heavy edges is o(l), and the fact that E,[W] is finite. 

The lower bounds follow from the fact that the unrestricted Steiner tree MST {Kn, m) 
w.h.p. has no heavy edges, and is at most as heavy as the bounded-depth/diameter Steiner 
trees. □ 



6.2 Lower bounds 

Proof of \Theorem lower bounds, other weight distributions. Fix some 5 > 0. Let b) 
be defined as F{a,b) but using {We)£. We argue that w.h.p., for every pair a < b, we have 
either Fe{a,b) > (1 — 6)F{a,b) or Fe{a,b) > ^Jn. Thus, the cost of a tree with given level 
sizes is either within (1 — 5) of the unmodified cost, or else is at least \fn. Since the optimal 
choice is smaller than ^Jn (here we use > 1), the proof of the lower bound carries over 
unchanged. 

We consider the graph of light edges, which is Gn,'p with p = e + o{e) . If every set A of 
size |y4| = a has at least b neighbors in the light-edge graph, then F^ifi,^ b) = F{a, b). (To see 
this, consider the sets A and B for which \A\ = a, \B\ = b, and Fi;{A, B) = Fi;{a, b). If there 
were a heavy edge from A to B, then we could delete the endpoint of that edge from B, 
and replace it with a vertex not already in B which is connected to A via a light edge, and 
^^(A, modified B) < Fi;{A,B), a contradiction. Hence there is no heavy edge from A to B, 
so F(a, b) < F{A, B) = F,(A, B) = F,{a, b) < F{a, b).) 

We consider several cases. 



Case b < en /A: For any vertex, its degree D in the light-edge graph is a binomial dis- 
tribution with parameters n — 1 and p = (1 + o{l))e. Since E,[D] = (1 + o{l))ne, the 
standard large-deviation formula (see |McD89[ Eqn. 5.6]) that we used earlier tells us that 
Ft[D < en/2] < e-(i+°(i))Ep]/8 ^ g-n£/(8+o(i))_ ^ ^^^^^ bound then tells us that w.h.p. the 

minimal degree is at least en/2. Conditional on this event, any set A has at least en/2 — \A\ 
neighbors, so if a < 6 < en/ 4, it follows that F^{a, b) = F{a, b). 

Case a > n^^^, b < n~ ra^/^: We argue that any disjoint sets A and C of sizes at least n^^^ 
and n^^^ have an edge between them. This is a union bound over all pairs of sets: the number 
of pairs of sets is at most 3", but each pair has no edge with probability (1 — p)'^^^^^^ . This 
implies that, for n large enough, w.h.p. -Fe(a, b) = F{a, b) for any a > n^^^ and b < n — n^/^, 
since any such set A has at most n^/^ non-neighboring vertices in the light-edge graph. 



Minimum bounded-depth/ diameter spanning and Steiner trees 



Angel, Flaxman, & Wilson 



24 



Case a > n^^^, b > n — n^^'^: By the monotonicity of F and F^, and the above case, for 
large enough n we have w.h.p. 

F(a, b) > Fe{a, b) > Fe{a, n - n^^^) = F{a, n - n^'^). 

However, based on our bounds on F from Lemmas 15.11 and I5.2[ we have 

F(a,n-n^/^) > (1 -5)F(a,6) 

for any 5 given n large enough. 



Case a < n^^^, b > en/A: By the monotonicity of F and F^, and the second case above, 
for large enough n we have w.h.p. 

Fe{a,b) > F^{n^'^,en/A) = F(n^/^er^/4). 

By lLemma 5.11 w.h.p. this is at least 

,^ ,,£2^2/16 ^ 3 

^^-^)^»v^»2^^' 
i.e., it exceeds the weight of the greedy spanning tree. □ 



7 Open problems 

We identified a sharp cutoff of depth log2logn ± G(l) above which the minimum bounded- 
depth spanning tree has weight that is asymptotically equal to the value of the unconstrained 
minimum spanning tree, and below which it is much larger. This same cutoff at log2 log n ± 
G(l) holds for minimum bounded-depth Steiner trees with m terminals when m = B(n), but 
we do not know the location of the cutoff (or indeed if there is one) when m is much smaller 
than n. If there is a cutoff, we know that it occurs when the depth k is in the interval 



log2 logm — log2 log(en/m) — uj{l) < k < log2 logn + uj 



n 



m \og[en/m) 



but we do not know where in the interval. It would be interesting to better understand the 
weights of bounded-depth Steiner trees for these parameter values. 

It would be interesting to understand better the large-n behavior of the weight of the 
bounded-depth MST near depth log2 log?7,-|-A as a function of A. The precise behavior could 
be complicated, and is perhaps a periodic function of the fractional part of log2logr;,, but 
there are more basic open problems. For example, our construction in § 3| shows that when 



the depth bound is log2logn + A, the bounded-depth MST has weight < C(3) + 0(l/v A), 
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while our best lower bound is C(3)- We do not know how fast the approach to ({3) is when 
A is increased, or indeed, if ^(3) is reached for some finite A. 

The weight of the minimum weight Steiner tree (with unbounded depth), as a function 
oi a = m/n (the ratio of the number of terminals to the number of vertices) goes from at 
a = to C(3) at a = 1. As mentioned in |BGRS04"] . it would be interesting to understand 
how the weight varies from to C(3) for intermediate values of a. 

There was an experimental study aimed at sharpening our estimate of (1 — o(l))|n^/^ for 
the asymptotic weight of minimum bounded-depth spanning trees when with depth bound 
k = 2 jBBB+08j . suggesting |n^/^ — const. This constant will depend on the weight distri- 
bution; it may be interesting to rigorously determine the constant. 
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